Optimizing Performance under Thermal and Power

نویسنده

  • OSMAN SAROOD
چکیده

Energy, power and resilience are the major challenges that the HPC community faces in moving to larger supercomputers. Data centers worldwide consumed energy equivalent to 235 billion kWh in 2010. A significant portion of that energy and power consumption is devoted to cooling. This thesis proposes a scheme based on a combination of limiting processor temperatures using Dynamic Voltage and Frequency Scaling (DVFS) and frequency-aware load balancing that reduces cooling energy consumption and prevents hot spot formation. Recent reports have expressed concern that reliability at the exascale level could degrade to the point where failures become a norm rather than an exception. HPC researchers are focusing on improving existing fault tolerance protocols to address these concerns. Research on improving hardware reliability has also been making progress independently. A second component of this thesis tries to bridge this gap and explore the potential of combining both software and hardware aspects towards improving reliability of HPC machines. Finally, the 10MW consumption of present day HPC systems is certainly becoming a bottleneck. Although energy bills will significantly increase with machine size, power consumption is a hard constraint that must be addressed. Intel’s Running Average Power Limit (RAPL) toolkit is a recent feature that enables power capping of CPU and memory subsystems on modern hardware. The ability to constrain the maximum power consumption of the subsystems below the vendor-assigned Thermal Design Point (TDP) value allows us to add more nodes in an overprovisioned system while ensuring that the total power consumption of the data center does not exceed its power budget. The final component of this thesis proposes an interpolation scheme that uses an application profile to optimize the number of nodes and distribution of power between CPU and memory subsystems that minimizes execution time under a strict power budget. We also present a resource management scheme including a scheduler that uses CPU power capping, hardware overprovisioning, and job malleability to improve the throughput of a data center under a strict power budget.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Performance Modeling of Power Generation System of a Thermal Plant

The present paper discusses the development of a performance model of power generation system of a thermal plant for performance evaluation using Markov technique and probabilistic approach. The study covers two areas: development of a predictive model and evaluation of performance with the help of developed model. The present system of thermal plant under study consists of four subsystems with...

متن کامل

Numerical Study for Optimizing Parameters of High-Intensity Focused Ultrasound-Induced Thermal Field during Liver Tumor Ablation: HIFU Simulator

Introduction High intensity focused ultrasound (HIFU) is considered a noninvasive and effective technique for tumor ablation. Frequency and acoustic power are the most effective parameters for temperature distribution and the extent of tissue damage. The aim of this study was to optimize the operating transducer parameters such as frequency and input power in order to acquire suitable temperatu...

متن کامل

The Optimal Design of Heat Sinks: A Review

Heat sinks are used in industrial equipment to dissipate the excess heat from their heat-generating parts to the ambient. In the last few years, efforts on manufacturing electronic or mechanical devices with less weight, space, and lower cost were spent. Heat dissipation from the heat sink is stalling a big problem which many researchers are trying to solve. The aim of this study is to brief th...

متن کامل

Place Finding and Optimizing the Determination of Production Units Dynamically for Providing the Electricity and Heat in Industrial City

In this article the place and capacity of combined heat and power [CHP] prediction unit wasdetermined dynamically with use of modified particle swarm optimization (MPSO). It was done inoptimization palace and with a capacity of CHP as a production resource with the aim to increasethe reliability capacity. Decrease the loss and provide the electrical and thermal energies ofindustrial city. The f...

متن کامل

A Markov Model for Performance Evaluation of Coal Handling Unit of a Thermal Power Plant

The present paper discusses the development of a Markov model for performance evaluation of coal handling unit of a thermal power plant using probabilistic approach. Coal handling unit ensures proper supply of coal for sound functioning of thermal Power Plant. In present paper, the coal handling unit consists of two subsystems with two possible states i.e. working and failed. Failure and repair...

متن کامل

Green Productivity in Iran's Thermal Power Plants: The Malmquist-Luenberger Approach

Electricity generation in thermal power plants as the largest producer of electricity in Iran is associated with greenhouse gas emissions. In this paper, using the Malmquist-Luenberger method, green productivity, and efficiency changes are measured for 31 thermal power plants (including 12 steam power plants, 13 gas power plants, and six combined cycle power plants) during 2009-2016. The result...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014